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1 Introduction 

Lusternik-Schnirelmann category is a homotopical invariant that has been 
widely studied HlfTlll . For a topological space X, the LS category c&tX 
is defined as the minimum number (minus one) of categorical open sets 
which are needed to cover X (an open set is categorical when it is con- 
tractible in X). 

Unfortunately the LS category is very difficult to compute. For in- 
stance, while the result cat Sp(2) = 3 was proven by P. Schweitzer in 1965 
ED, it was not until 2002 that cat Sp(3) = 5 appeared in Q, see also IfTOt 
In general, the algebraic techniques involved are highly elaborated. It is 
then of interest to introduce more elementary methods. 

The main idea of this paper is to compute the LS category of some 
Lie groups and homogeneous spaces of the orthogonal type by means of 
the so-called Cayley transformation. This will give proofs — which are 
simpler than the original ones — , of cat U(n) = n (by W. Singhof H19II ). 
cat U(n)/Sp(n) — n and cat U(n)/0(ri) = n (by M. Mimura and K. Sugata 
IflHl). 

Our method is closely related to Morse theory on Lie groups. Classi- 
cally, the functions that use to be considered are "height" or "distance" as 
in Il5l l2lll22l . On a matrix Lie group G these functions are, up to a con- 
stant, of the form hx(A) — $lTr(XA), the real part of the trace, for some 
matrix X, a fact which allows to explicitly describe the Bott-Morse struc- 
ture of these functions. We shall prove that the Cayley transform serves 
to linearize the gradient flow of hx and to give local charts for the set of 
critical points. These results generalize those of K. Y. Volchenko and A. N. 
Kozachko (EH, see also EH ). 

Let us remember that in a compact manifold the LS category (plus one) 
is a lower bound for the number of critical points of any smooth function 
(Morse or not). The reason is that -roughly speaking- for each critical 
point the gradient flow defines a categorical open set. What is nice in our 
setting is that this flow is given by the contraction associated to the Cayley 
map. 

Moreover, the Morse interpretation above allows us to give an explicit 
covering of Sp(2) by four categorical open sets, a result that completes the 
abstract proof by Schweitzer IfTHH . 

At the end of the paper we explain how the generalization of our re- 
sults to the symplectic group Sp(n) depends on the computation of the so- 
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called left eigenvalues of a quaternionic matrix (25), a topic about which 
very little is known, out of the case n = 2. 

We hope that the ideas presented here will deserve further attention. 



2 The Cay ley transform 

The classical Cayley transform was introduced by A. Cayley in 1846 (3), 
as a way to express an orthogonal transformation by means of skew- 
symmetric coordinates. It is given by 

y J I + X 

This map is defined for all matrices having their eigenvalues different 
from —1 and equals its own inverse, c 2 = id. It can be thought as a gener- 
alization of the stereographic projection. 

Its basic properties appear in H7I . see also 1(231 . 

In order to obtain a categorical covering of the orthogonal Lie groups 
we shall introduce in the next paragraphs a convenient generalization of 
the classical Cayley map. 



2.1 Preliminaries 

Let the algebra IK be either R (reals), C (complex) or H (quaternions). We 
say that the matrix A e M(n, K) is orthogonal if AA* = id, where A* = A* 
is the conjugate transpose. Such a matrix can be identified with a (right) 
K-linear map K" — > K™ preserving the product (v, w) = v*w. Let us denote 
by G = 0(n, K) the Lie group of orthogonal matrices. Depending on K this 
group corresponds to the orthogonal group 0(n), the unitary group U(n) 
or the symplectic group Sp(n). 

Remark 1 The Cayley transform maps a classic orthogonal Lie group like 
G = U(n) or Sp(n) into its Lie algebra g of skew-hermitian matrices. In 
fact, suppose that A is a unitary or symplectic matrix. It can be diagonal- 
ized, A = UDU*, to a complex diagonal matrix D = diag(Ai, . . . , A„) Il2l . 
Then 

c(A) = C7diag(7r(Ai),...,7r(A„))C/*, 
where -n is the stereographic projection tt: S 1 ^-!} — > R. 

2.2 Generalized Cayley transform 

Let A e 0(n, K) be an orthogonal matrix, where K is R, C or EL 



3 



Definition 1 Let us denote by Q(A) c M(n, K) the open set of matrices 
X such that A + X is invertible. The Cayley transform centered at A is the 
map 

c A : Q{A) -► 

given by 

Cyl (X) = (/-A*X)(A + X)- 1 . 

The classical Cayley map corresponds to A = I. As we shall see in the 
next Proposition, the application ca is well denned and it is invertible, 
with c^ 1 = ca*- 

Proposition 1 If X e Q{A) then 

1. c A {X) = (A + X)- 1 (I -XA*); 

2. the inverse matrix of A* + c A (X) is (1/2)(A + X); 

3. ifX e Q{A) then c A (X) e Q{A*); 

4. ca is a diffeomorphism, with c^ 1 = ca*- 

Proof (1) It suffices to verify that (A + X)(I - A*X) = (I - XA*)(A + X), 
which is immediate because AA* = id. For (2) we compute 

(A* + A*X)(A + X)- 1 ) (1/2)(A+X) = (1/2) (A* A + A*X + 1- A*X) = I. 

Part (3) comes immediately from (2). Finally, by using (2) we obtain that 

(C4« o C A ){X) = 

(I-Ac A (X))(l/2)(A + X) = 
(I - A(I - A*X)(A + X)- 1 ) (l/2)(A + X) = 

(1/2) ((A + X) - A + AA*X) = X. 

□ 

We shall need the following interesting properties, which are easy to prove: 

Proposition 2 Let X e Q{A). Then 

1. X* e Q(A*) and c A * {X*) = c A (X)*; 

2. UXU* e Q{UAU*) for any matrix U e 0(n, K) and 

c UAU *{UXU*) = Uc A {X)U*; 

3. if the matrix X is invertible then X^ 1 e Q(A*) and 

CA-iX- 1 ) = -Ac A (X)A. 
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2.3 Categorical open sets 

The results in this paragraph show that the domain of the Cayley trans- 
form in an orthogonal group is contractible. 

The Lie algebra of G = 0(n, K) is formed by the skew-symmetric (resp. 
skew-hermitian) matrices, 

= o(n,K) = {X e M(n,K):X + X* = 0}. 

As a vector space q = TjG, so the tangent space at any other point A e G 
is 

T A G = L A (TiG) = {Y e M(n, K): A*Y + Y*A = 0}. 

Proposition 3 Let X e o(n, K) be a skew -symmetric (resp. skew-hermitian) 
matrix. Then X has not real eigenvalues different from zero. 

Proof Suppose that there exists t £ R such that Xv = vt for some v € K", 
v =/= 0. Then v*Xv = v*vt = \v\ 2 t is a real number and, consequently, 

v*Xv = (v*Xv)* = v*X*v = v*(-X)v = -v*Xv. 

Therefore v*Xv is null. i.e. \v\ 2 t = hence t = 0. □ 

Corollary 1 The real vector space T A G of the matrices Y such that A*Y + 
Y*A — is contained in (7(A) = {Y e M(n, K): A + Y is invertible}. 

Proof If A+Y is not invertible then there exists v ^ such that Yv = —Av 
so A*Yv — —v. This means that the skew-symmetric matrix A*Y has —1 
as an eigenvalue, contradicting Proposition[3l □ 

Let G = 0(n, K). We shall denote by Q G {A) the open subset (2(A) nG c 

G. 

Theorem 1 The generalized Cayley transform c A maps diffeomorphically 
(2g(A) onto T A ,G, with c A (A) = 0. As a consequence, the open set (2c(A) is 
contractible. 

The proof is an immediate consequence of Propositions Q] HI , M and 
Corollary [TJ 

3 Bott-Morse functions on Lie groups 

There is a deep relationship between the Cayley transform and Morse 
theory in Lie groups. 

In this Section [3] we prove (Proposition [4]> that one can integrate the 
gradient flow of any height function h x ■ G — > R by applying the Cayley 
transform c A , to a simple curve in T A -G, provided that c A *(0) — A is a 
critical point. As a consequence we give a local model for the set S of 
critical points of h x (Theorem©. 
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3.1 Critical points of a height function 

Let G = 0(n, K) be an orthogonal group embedded in the euclidean space 
E = M(n, K). The euclidean metric is given by (A,B) = 3?Tr(A*B), the 
real part of the trace. I. A. Dynnikov and A. P. Veselov 112 1 1 and Volchenko 
and Kozachko 1 22 ] studied the height functions on G (height with respect 
to some hyperplane), while H. Duan [5.1 studied the distance functions on 
G (distance to a given point). Up to a constant, both classes of functions 
are given by the formula 

h x (A) = ^RTr(XA), 
for some matrix X e E, as it is easy to prove. 

Example 1 When X = diag(Ai, . . . , A„) is a positive real diagonal matrix, 
with A x < • • • < A„, the function h x is a perfect Morse function, whose 
critical points are the diagonal matrices 

diag(ei, . . . ,£„), e k = ±1. 

This result is proven in [5| and [21], see also lH6l . 

Example 2 On the other hand, the case X = 1 1, t e K, was first studied 
by T. Frankel in (8). This time, the height map is a Bott-Morse function, 
invariant by the adjoint action. The set E(ri) of critical points is formed 
by the matrices A such that A 2 = I. 

The two preceding examples are particular cases of the next Theorem, 
that we suppose is more or less folk. It gives a general description of the 
set of critical points of an arbitrary height function hx- 

A direct computation shows that the gradient of h x on G is given by 

{gmdh x )A = ^(X* -AXA). 

Moreover, if A e G is a critical point, then the Hessian operator (HIi x )a- TaG 
TaG is given by 

(Hh x ) A (U) = -^(AXU + UXA), U £ TaG. 

We keep the notation of Example HI 

Theorem 2 Let X £ E = M(n,K) be an arbitrary matrix. Let no be the 
dimension of its kernel and let < t\ < ... < t k be the non-null (real) 
eigenvalues of X X* , with multiplicities m, . . . , n k , that is no+rii + - ■ -\-n k 
r>. Then the set of critical points of the height function hx is 

E(hx) — 0(no,K) x E(m) x • • • x E(rik) (diffeomorphism). 
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Proof First, if Y = UDU*, then S(h Y ) = US(h D )U*. On the other hand, 
if X = US is a polar decomposition (U orthogonal, S hermitian) then 
S(h x ) = S{h s )U*. Finally, if C = diag(-J p , +I q ) then £{h Y ) = CS{h YC )- 
These properties prove that we can restrict ourselves to the case where X 
is given by diagonal blocks 



hi 



o < h < ■ ■ ■ < tk, 



t k lj 



of size n Q ,ni, . . . ,nk- The rest of the proof is a direct computation. 



□ 



The transformations in the proof of Theorem|2]preserve the non-degenerate 
critical points, so we have the following corollary which completely char- 
acterizes the height functions which are Morse functions. 

Corollary 2 The height function h x is Morse if and only if the matrix 
XX* is invertible and has n different eigenvalues. 



3.2 Gradient flow and local model of the critical point set 

Our next Proposition is a generalization of the same result for the classical 
Cayley transform a by Volchenko and Kozachko 11221 . Following the ter- 
minology of these authors we shall call linearization the process of trans- 
forming the gradient flow of hx in G to a flow in the Lie algebra. 

Proposition 4 Let hx be an arbitrary height function onG = 0(n, K) and 
let Abe a critical point. The solution to the gradient equation 

a' = -(X* - aXa) 

passing through a(0) e ^g(A) is the image by the generalized Cayley 
transform ca* of the curve in Ta*G defined as 

(3{t) = exp(-XAt/2) ■ fa ■ exp(-AXt/2), f3 a = c A {a(0)). 

Proof Notice that the matrices XA and AX are symmetric (hermitian) 
because X* = AXA and A*A = id. 

First, from Ae XA = e AX A it follows that A/3 + {A/3)* = 0, that is /? G 

Ta-G. 

Now, from the definition of f3, it is 

= (-1/2) (XAP + 0AX) . (1) 

Let 

a = c A ,o0=(I- Af3){A* + f3)-\ 
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hence 

a{A* + (3)=I-A(3. 

Derivation gives 

a 1 (A* + p) + a/3' = -A/3' 

that is 

a'(A*+j3) = -{A + a)0. 
By Proposition [H the inverse of A* + (3 = A* + c A (a) is (1/2)(A + a), so 

a' = 

(-l/2)(A + a)f3'{A + a) = (2) 
(+1/4) (A + a){XA/3 + f3AX)(A + a). 

Now 

13 = c A {a) = (I- A*a)(A + a)- 1 = (A + a) -1 (I - aA*) 
implies that 

(3(A + a) = I-A*a 

and 

(A + a)(3 = I - aA* 

so from Equation <[2]) 

4a' = 

(A + a)XA{l - A*a) + (I - aA*)AX(A + a) = 

2AXA - 2aXa = 
2{X* -aXa). 

□ 

Remark 2 Indeed, when X = A = I, (3{t) = cxp(— 1)/3 is the radial con- 
traction to p - 

We now show how the Cayley transform serves to give a local chart for 
the set of critical points. This result is completely new. 

Let h x {A) = 5i Tr(XA) be an arbitrary height function on the Lie group 
G = 0(n, K). Let S be the set of critical points of h X - If A 6 S is a critical 
point we denote by S(A) the real vector space 

S(A) = {/3 e Ta'G: XA(3 + (3 AX = 0}. 



Theorem 3 The Cayley map 

c A ,:S(A) Enf2 G (A) 

is a diffeomorphism. 
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Proof We need to prove that the curve 0(t) in Proposition |4] is constant if 
and only if /3 £ S(A). This can be achieved by using Equation (fl}. Then, 
by Proposition SI ca* (0o) is a critical point if and only if (3'(t) = for all t, 
if and only if 

XAfi + p AX = 0. 

□ 

Example 3 Suppose X = I and K = C. Then the critical points of hi are 
the matrices A g U(n) such that A 2 = I. Such a matrix A = A* can be 
diagonalized to D = diag(ei, . . . , e n ), e k = ±1. On the other hand, 0o g TaG 
iff Af3 is skew-symmetric, A/3 = -0qA, while O G S(A) iff Af3 + /3 A = 0. 
It follows that A, = Po- 

So, for instance, the identity / and its opposite — / are critical points 
that are isolated because S(±I) = 0. On the other hand, let A = diag(7 p , — I q ), 
Then /3 &T A G must be of the form 



0o 



V* 
V 



which implies dim S(A) = 2pq. This is in fact the dimension of the (critical) 
orbit of A, which is diffeomorphic to the Grasmannian U(p + q)/(U(p) x 
U(q)). 

Example 4 Let X = diag(g 1; .... q n ) be a diagonal matrix, with q k / 0. 
Assume that \q±\ < . . . < \q n \. This time the gradient condition A* X* = XA 
implies that a critical point has the form 

A = diag(±|gi|/gi, . . . , ±\q n \/q n ). 

Since 

XA = AX = diag(ei|gi|, . . . ,e n \q n \), e k = ±1, 
it follows that S(A) = 0. So all critical points are isolated. 



4 Applications to LS category 

4.1 The unitary group U(n) 

W. Singhof proved in [ 19 1 that the LS category of the special unitary group 
SU(n) is n — 1, hence that of U(n) = S 1 x SU(ii) (diffeomorphism) is n. 
Although he obtained an explicit categorical covering by using the expo- 
nential map, his method has the inconvenience that a logarithm branch 
has to be chosen, hence introducing some technical complexities. 

With the Cayley transform we obtain at each point A g G = U(n) 
a contractible open set ^g(A) diffeomorphic to the Lie algebra u(n) (see 
Theorem [Hi. It is then very easy to find an explicit categorical covering of 
the group by n + 1 open sets. 
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Theorem 4 cat U(n) = n. 



Proof Let X e U(n) be a unitary matrix and let z ■ id £ U{n) be the diago- 
nal matrix diag(z, . . . , z), where z £ C is any complex number with \z\ = 1, 
Remember that X £ J2(z • id) iff the matrix z ■ id + X is invertible. Let 
Ai, . . . , A„ be the eigenvalues of X, then after diagonalizing we obtain: 

z ■ id + X = [/diag(z + X 1 ,...,z + X n )U* , U £ U(n), 

meaning that X £ fl(z ■ id) iff A; ^ — z, Vi = 1, . . . , n. 

Let us take n + 1 different complex numbers z , . . . , z n , with jz^l = 1. 
Let Ak = —Zk ■ id. Since any matrix X e U(n) has at most n different 
eigenvalues there is always some -z k which is not an eigenvalue of X, 
that is X £ fi{A k ). This proves that U(n) = U^ =0 f2 G (A k ), hence cat U(n) < 
n. On the other hand, as it is well known HUH , a lower bound for the LS 
category is given by the length of the cup product. The cohomology of the 
unitary group being 



(see (H p. 273]), the longest non-null cup product is x\ A x 3 A • • • A x 2n -i £ 



4.2 The symmetric spaces U{2n)/ Sp(n) and U{n)/0(n) 
Theorem 5 cat U(2n)/Sp(n) = n. 

The following proof is also a simplification of the original one II 1411 . 

Proof Following Mimura and Sugata, we consider the action U ■ X = 
UXU T of U(2n) on the manifold 



This action turns out to be transitive with isotropy Sp(n). Then U(2n) / Sp(n) = 
M. Here the symplectic group is identified (via complexification) with the 
subgroup of matrices U £ U(2n) such that UJU T = J, where 



H(U (n)) = A(xi,x s , x 2n -i) 



H n , hence n — I.e. p. < cat U(n). Equality follows. 



□ 



M = {X £ U(2n): X + X T = 0}. 




Now, consider the manifold 



M' = JM = {7 6 U(2n):Y T = -JYJ} 



which is diffeomorphic to M because J 2 
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Let Y e M', let A e C with |A| = 1. Then Y e f2(\I) if and only if 
— A is an eigenvalue of Y. But as remarked in II 141 . Yv = —v\ implies 
Y(Jv) = -(Jv)X, which means that Y can be diagonalized as 

Y = U [ D Q n) u *> ^ = dia s(^ ■ ■ ■ AO, 

so 

showing that the maximum number of different eigenvalues of Y is n. 
This implies that when taking n + 1 different complex numbers Zo,...,z n , 
with \z k \ = 1, the open sets Q(-z k I) will cover M' . 

It only remains to show that the Cay ley contraction remains inside M'. 
More explicitly, for any z € C, \z\ = 1, let the Cay ley map be 

c zI :Q(zI)^T^G, 

let Y e fi(zl) n M' and take the radial contraction tc zI (Y), t e [0, 1]. In 
the same way of Proposition [2] it is easy to prove that 

c zI (Y T ) = -c zI (Y) T 

and that 

c zJ (-JYJ) = -Jc zI (Y)J. 

This implies that the image c z i(tc z i(Y) of the contraction by the inverse 
Cayley map is contained in M'. 

Hence cat U(2n)/Sp(n) < n. On the other hand ((151 P- 149]), 

H(U(2n)/Sp(n)) = A(xi,x 5 ,x g , . . .,x in - S ) 

so n = I.e. p. < cat U(2n)/Sp(n). Equality follows. □ 

Theorem 6 cat U(n)/0(ri) = n. 

The proof is completely analogous to the preceding one. 



4.3 The symplectic group Sp(2) 

The LS category of G = Sp(2) was computed for the first time by P. Schweitzer 
lfT8H , who proved that cat Sp{2) = 3. 

By using the Morse theory explained above we shall easily obtain an 
explicit covering by four categorical open sets. 

Let us consider the four critical points of a height function hx as in Ex- 
ample Q] of Section [3l namely: the identity I = diag(l, 1), P = diag(— 1, 1), 
-P and -I. 

Theorem 7 {±Qq(I), ±J7g(P)} is a categorical covering of G = Sp(2). 
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Proof Remember that X e Qg{A) means that A + X is invertible. First 
we see that the open sets i7 G (7) and Qq(—I) cover the whole group ex- 
cepting the orbit UPU* of the matrix P by the adjoint action. In fact, by 
diagonalization, this orbit is formed by the matrices having 1 and — 1 as 
eigenvalues and is diffeomorphic to the sphere S 4 , so it can be covered by 
stereographic projection. 

Explicitly, we must prove that given X = (i y -) = UPU* in the orbit of 
P, either P + X or — P + X is invertible. 

Since P 2 = X 2 = I we have 

(P + X) 2 = 2I + PX + XP = 2diag(l - x n , 1 + x 22 ). 

Then P + X is invertible if and only if xu ^ 1 and x 22 ^ — 1. Suppose 
x\\ = 1. The condition X*X = I means that the columns of X form an 
orthonormal basis of H 2 for the hermitian product (v, w) = v*w. Then, 
xu = 1 implies x\ 2 = x 2 \ = 0. But since X is in the orbit of P, it must 
be x 22 = — 1. Hence X = — P. The same conclusion is obtained from x 22 

1 . So i n fact f2 G (P) covers all the orbit of P, excepting -P. Since —Pe 
S7g(—P), the proof is done. □ 

Remark 3 The cohomology of the symplectic group is |[T5l p. 119] 

H(Sp(n)) = A(x 3 , xt,..., Xin-i) 

so the longest non-null product is x 3 A x 7 A ■ ■ ■ A x 4n _i and I.e. p. Sp(n) = n. 
However Schweitzer IfTHII was able to prove that cat Sp(n) > n+ 1 for n > 2. 



4.4 Left eigenvalues of symplectic matrices 

Obviously it is not worthy to apply the method of critical points above to 
the symplectic group Sp(n), n > 2 (for instance it is known that cat Sp(3) = 5 
(7]|)- Instead, in this section we discuss the possibility of extending the 
eigenvalue method of paragraphs 14. l l and 14.21 to the symplectic setting. 

We shall briefly explain the underlying difficulties. First, it is neces- 
sary to endow the quaternionic space HP with the structure of a right 
H-vector space, in order to obtain the usual results for the matrix asso- 
ciated to a linear map. Second, the theory of right eigenvalues is well 
established, including diagonalization of symplectic matrices Ifll l2ll6ll24l . 
However, for a matrix A e Sp(n) and a quaternion a E H, the condition 
that A — al be invertible is not related to a being a right eigenvalue. In- 
stead we must consider left eigenvalues. 

Definition 2 A quaternion a 6 HI is a left eigenvalue of the matrix A £ 
M{n, H) if and only if there exists v £ HP, v 0, such that Av — av. 
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Unfortunately, very little is known about left eigenvalues of quater- 
nionic matrices. Their existence, number, and methods for computing them 
are only partially understood, see Zhang's paper ['25] for a recent review. 
For n = 2 two of the authors were able to prove the following Theorem, 
based on previous results by Huang and So (9|. 

Theorem 8 ( 1121 ) A symplectic matrix A G Sp(2) has either one, two or 
infinite left eigenvalues. The latter case can only occur when 



As a corollary it is possible to obtain the following deceptive result. 

Corollary 3 ([ 131) Let cr , . . . , er 3 be four arbitrary quaternions of module 
1. Then the categorical open sets Qg{&o ■!),-■■, ^g(°3 • I) do not cover the 
group G = Sp(2). 

Acknowledgements We acknowledge many useful discussions with D. Tanre. 
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